Audio-Based Semantic Concept Classification for Consumer Video
نویسندگان
چکیده
منابع مشابه
Rule-Based Semantic Concept Classification from Large-Scale Video Collections
The explosive growth and increasing complexity of the multimedia data have created a high demand of multimedia services and applications in various areas so that people can access and distribute the data easily. Unfortunately, traditional keyword-based information retrieval is no longer suitable. Instead, multimedia data mining and content-based multimedia information retrieval have become the ...
متن کاملA Novel Fusion Method for Semantic Concept Classification in Video
Semantic concept classification is a critical task for content-based video retrieval. Traditional methods of machine learning focus on increasing the accuracy of classifiers or models, and face the problems of inducing new data errors and algorithm complexity. Recent researches show that fusion strategies of ensemble learning have appeared promising for improving the classification performance,...
متن کاملRobust audio-based classification of video genre
Video genre classification is a challenging task in a global context of fast growing video collections available on the Internet. This paper presents a new method for video genre identification by audio analysis. Our approach relies on the combination of low and high level audio features. We investigate the discriminative capacity of features related to acoustic instability, speaker interactivi...
متن کاملDetecting audio events for semantic video search
This paper describes our work on audio event detection, one of our tasks in the European project VIDIVIDEO. Preliminary experiments with a small corpus of sound effects have shown the potential of this type of corpus for training purposes. This paper describes our experiments with SVM classifiers, and different features, using a 290-hour corpus of sound effects, which allowed us to build detect...
متن کاملFactor analysis for audio-based video genre classification
Statistical classifiers operate on features that generally include both useful and useless information. These two types of information are difficult to separate in the feature domain. Recently, a new paradigm based on a Latent Factor Analysis (LFA) proposed a model decomposition into usefull and useless components. This method was successfully applied to speaker and language recognition tasks. ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Audio, Speech, and Language Processing
سال: 2010
ISSN: 1558-7916,1558-7924
DOI: 10.1109/tasl.2009.2034776